Coffee Visualization Project

By: Khrystyna Kubatska

Date: December 2021

Data description

Source: ICO Coffee Dataset
The selected dataset contains coffee stats worldwide by 13 different features from 1990 to 2018.

In the project, the following files from this dataset were used for visualizations:

Also, one additional file (world-countries.json) with polygon data was used for one of the visualizations.

Visualization 1

The visualization aim is to show how the imported coffee amount in each country has changed over the years.

The best solution for this visualization, in my opinion, is to display the data on a map using a geopandas library, encoding the amount of exported coffee in color. The color pallet is 'goldgreen': yellow displays lower values; the higher the green hue, the higher the value.
Tooltips with country, import, and year information are also added.
The map is interactive: a user can filter the data by years using a slider under the map.

The main advantage of the visualization is the convenience of viewing the whole picture in the world's changing periods.
Disadvantages: a relatively large numbers range (0 ~ 30,000) divided into a relatively small number of color ranges (step - 4000) -> Impossibility to compare values from the same interval without information from tooltips.

Visualization 2

The purpose of the visualization was to show changes in prices paid to coffee growers in exporting countries in 2018 based on the 1990 year.

For the visualization, I considered the following ways: bar chart (simple to understand), encoding by color on map (cons: data is available only for 10 countries, so there is no need to show all world map), radar and bubble chart (cons: encoding by size is less convenient than by length).

I selected the horizontal bar chart. Each column corresponds to a country. The X-axis is not shown to not overload the visualization; instead, percentage values are displayed on bars, which are sorted in descending orders. Y-axis is still shown since, in my opinion, it is easier to interpret the chart. Tooltips also are added.

The main advantage of the chart is its easiness of understanding and comparing values.

Visualization 3

The purpose of the visualization was to show payment changes for coffee growers in 2018 based on the 1990 year in USD/kg.

I considered the bar chart and line chart for the visualization, but I chose the dumbbell chart. Countries are located on the Y-axis, and prices for each country in 1990 and 2018 are located on the X-axis. Values for years vary by color. Countries are sorted in descending order by the largest prices. Also, tooltips with the country name, payments, and year are added.

The advantages of this visualization: convenient to compare values for countries with each other; easy to track changes; the ability to see specific data due to tooltips.

Visualization 4

The main visualization purpose is to display the ICO Composite and group indicator prices from 1990 to 2018.

For this visualization, I considered bar charts and adding interaction but selected a line chart because its main advantage is the easiness of interpretation.

Each line represents a type of coffee; the X-axis encodes years; Y-axis encodes prices indicators. There are tooltips with year, coffee type, and indicator price information.